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Abstract. The inspirals of stellar-mass compact objects into supermassive black 
holes are some of the most important sources for LISA. Detection techniques based on 
fully coherent matched filtering have been shown to be computationally intractable. 
We describe an efficient and robust detection method that utilizes the time-frequency 
evolution of such systems. We show that a typical extreme mass ratio inspiral (EMRI) 
source could possibly be detected at distances of up to ~ 2 Gpc, which would mean ~ 
tens of EMRI sources can be detected per year using this technique. We discuss the 
feasibility of using this method as a first step in a hierarchical search. 
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1. Introduction 



Astronomical observations indicate that many galaxies host a supermassive black hole 
(SMBH) in their centre. The inspirals of stellar-mass compact objects into such SMBHs 
with mass M ~ few x 1O 5 M -1O 7 M constitute one of the most important gravitational 
wave (GW) sources for the planned space-based GW observatory LISA. Preliminary 
results indicate that the LISA EMRI detection rate will most likely be dominated 
by inspirals of ~ 10 M BHs into ~ 1O 6 M SMBHs. The EMRI detection rate could 
be as many as ~ 1000 in 3-5 years within ~ 3.5 Gpc. 

The strain amplitude of GWs from EMRIs can be estimated using the Newtonian 
quadrupole approximation to the Einstein field equations, 

where / is the orbital frequency, d is the distance of the source from the Earth and 
/i = mM/ (m + M) is the reduced mass. This can be compared with the characteristic 
noise strain of ~ 5 x 10~ 21 at the floor of the LISA noise curve near 5 mHz For 
a 10 + 1O 6 M EMRI system at 1 Gpc, the instantaneous signal-to-noise ratio (SNR) 
Pt is at best around 0.1. Detection of GWs from EMRIs therefore depends on (semi-) 
coherent accumulation of the signal with time. 

The optimal method to detect a known time series signal h(t) embedded in 
stationary Gaussian noise n(t) is matched filtering. In that technique, we search for 
the maximum correlation of the Fourier components of the data with that of the known 
waveforms, weighted by the noise variance. The optimal SNR, pm, can be written as 

P 2 m = H% ( 2 ) 

k=i ™fc 

where hk is the Fourier amplitude of the signal, o\ h = 0.5Sh(f)/ (dt 2 df) is the expected 
variance of the noise component rik at frequency bin k, characterized by Sh(f), the 
strain spectral density of the noise, N is the number of Fourier frequency bins and 
df is the bin width. The SNR squared is therefore effectively proportional to the 
product of the number of wave cycles with the instantaneous SNR squared. During an 
integration over the lifetime of LISA (~ 3-5 yrs), the number of GW cycles observed, 
Nqw ~ Tf ~5 x 10 5 , so the optimal SNR can be as high as pu ~ 100 at 1 Gpc. 



2. Computational challenges of EMRI detection 

EMRI waveforms are complex and are characterized by many frequency components, 
which arise from several effects. First, typical EMRI orbits are expected to be still 
moderately eccentric, e ~ 0-0.5, during the last several years of inspiral when LISA 
can detect them [3j — [HI - At such moderate eccentricities, there can be as many as five 
harmonics of the orbital frequency contributing significantly (> 10%) to the observed 
SNR [T. In addition, EMRI signals exhibit many modulations, caused by periastron 
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precession, spin induced precession of the orbital plane and yearly amplitude and 
Doppler modulation due to the motion of LISA around the sun. Finally, the frequency 
components in an EMRI signal exhibit significant evolution over a LISA observation. 
For a 3 year observation of a signal with central frequency ~ 5 mHz, the signal power 
can be spread over as many as 10 5 frequency bins [3J. This hinders the detection of the 
signals using simple Fourier spectrum analysis. 

The complexity of the EMRI waveforms makes a fully coherent matched filtering 
search computationally impossible. Rough estimates would suggest that ~ 10 40 
templates are needed for a fully coherent search pQ . Extrapolating to the time of the 
LISA mission, it is reasonable to assume ~ 50 Tflops of available computing power for 
the search, but this allows only ~ 10 12 templates to be searched in real time. Alternative 
methods are therefore required to detect EMRIs, such as semi-coherent hierarchical 
searches jTj. 



3. A time-frequency detection method 

We describe an efficient and robust strategy to detect GWs from EMRIs by accumulating 
the signal power in the time-frequency (t-f ) domain. The t-f power spectrum is produced 
by dividing the data into 2 week long segments and carrying out a Fast Fourier Transform 
(FFT) on each. In the semi-coherent matched filtering search pQ, the waveform is also 
divided into sections, of ~ 3 weeks. In that case, this is the longest segment length 
that computational constraints will allow. In the time-frequency analysis, there are no 
such computational limits, but we choose a 2-week duration to ensure enough time and 
frequency resolution to trace the frequency evolution of EMRIs with time. The power 
spectrum is defined for each segment i and frequency bin k as, 



p , - , ■ mj + nj)\ 2 _ 2(/4) 2 A Re[KKY] 2K) 2 

f\l,K)- — 2 - — - 2 h4 — V — - . [6) 
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We then calculate the power "density", p(i, k), by computing the average power within 
a rectangular box centered at each point (i, k), 

n/2 1/2 

p(i,k)= S P(i + a,k + b)/m, (4) 

a=-n/2 b=-l/2 

where n, I are the lengths of the box in the time and frequency dimension respectively 
and m = n x I is the number of data points in the box. The SNR at each point 
is then p s = (p — p)/o~ p , where p is the mean of p calculated in the entire t-f plane 
and a 2 is the expected variance of p for pure noise. In practice, we use the variance of 
the calculated p in the entire t-f plane. The detection process involves finding the local 
maximum p s or tracks of "excess" p s . 

If the data consist of only stationary Gaussian noise, mp will follow a X2m 
distribution, with expected a p = 2/y/m, i.e., the larger the box, the smoother the 
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noise power density in the t-f plane. For a given box size, the false alarm probability 
(FAP) for finding at least one point with p s above a certain threshold po is 



where Q x 2 (P) is the cumulative distribution function for the xlm distribution. We 
estimate Nf ~ N/(m/A) for the number of independent data points searched. 

To search for a possible signal, we vary the box lengths n and I until the maximum 
(or a significant) p s is found. The optimal box size should be large enough to contain 
most of the signal power but small enough to exclude most of the noise contribution. The 
overall probability of finding a FAP m below some threshold FAP depends on the number 
of independent trials of different box sizes. A Monte Carlo simulation is in progress to 
determine the statistics of this method and to compute appropriate thresholds. In the 
present work, the FAP of the search is based on a simple case where we increase the 
box dimensions by factors of two, one side at a time, and the overall FAP is estimated 
as FAP m multiplied by the number of boxsizes searched. In this paper, significant 
detections are defined as those such that the overall FAP of the search is < 10~ 2 . 

Like many other time-frequency signal processing methods, this method examines 
the statistics of the presence of a lot of high power in a region. Our method is in 
particular similar to the "excess power" method jH], as both use the summation of 
powers within a certain time and frequency interval. The excess power method was 
designed to detect bursting waveforms. Our approach applies to the detection of both 
burst-like and continuous waves since it helps to map out the structure of the excess 
power density. This structure can then be detected by finding the local maximum or 
using pattern-recognition methods. 

4. Simulated EMRI waveform 

To test this approach, we tried to detect an EMRI signal in simulated data. Accurate 
inspiral waveforms are not yet available, so we made use of approximate numerical 
waveforms, as described in [TJ |5J ED] . We considered a "typical" EMRI event — the 
inspiral of a 1OM BH into a 1O 6 M SMBH, with eccentricity e = 0.4 and pericentre 
r p « 11M at the start of the observation, SMBH spin of a = 0.8M, orbital inclination 
angle of 45° (using the definition of inclination in 9J) and placed at distances of 0.5-2 
Gpc. We used data of total duration three years, sampled at a cadence of 8s. With 
these choices, the total number of data points analyzed was iV = 1.2 x 10 7 . The 
simulated data consist of two independent LISA data streams (the low frequency T and 
TT responses described in [2 ). The combined matched filtering SNR at a distance of 1 
Gpc is pm ~ 140 for the whole three years of data, and ~ 90 for the last year. We used 
the LISA noise response given in 0. 




(5) 
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5. Results and Discussion 

In Figures d and El we show the normalized power density p s in the time-frequency 
domain calculated with the "optimal" box size when the EMRI was at a distance of 0.5, 
1.0, 1.4, and 2 Gpc respectively. We also show the power distribution function and the 
pure noise theoretical expectation for comparison. 

At the distance of 0.5 and 1 Gpc, the evolution of the GW central frequency (and 
harmonics) with time is apparent to the eye in the time-frequency plane. The amplitude 
increases as the particle inspirals but the signal is also modulated by LISA's motion. 
At 0.5 Gpc, GWs from the last year of inspiral can be detected at SNR ~ 28, 19, and 
8, respectively at each of the three dominating frequency components. At the distance 
of 1.4 Gpc, the frequency evolution is visible over the last year and two frequency 
components are apparent. At a distance of 2 Gpc, the signal can possibly be detected 
with an SNR of ~ 7, and an overall FAP of ~ 2 x 10 -6 when searching through all 
independent trials. 

To assess the efficiency of this method, we show in Figure|3]an approximate Receiver 
Operator Characteristic (ROC) curve for this method. The ROC is shown for the sources 
at 1 Gpc, 1.4 Gpc and 2 Gpc discussed in the text, and also distances of 1.75 Gpc, 2.25 
Gpc, 2.5 Gpc and 3 Gpc for comparison. The ROC curves were computed by setting 
thresholds on p for each bin size and performing a preliminary Monte Carlo of ~ 20000 
noise realisations. The false alarm probability was computed as the fraction of pure noise 
realisations in which a threshold was exceeded for at least one bin size. The detection 
rate was the fraction of realisations of signal plus noise in which the maximum SNR 
exceeded the threshold for at least one bin size. The thresholds were set by fixing the 
FAP m defined by equation (jHJ) to be equal for all bin sizes, taking Nf = N/{m/A). 
Different choices of thresholds amount to distributing the overall FAP of the search 
between the various bins in different ways. The optimum threshold choice for a single 
source will be source dependent. Monte Carlo simulations are underway in order to 
optimise the threshold choice in the sense of giving the best performance. We see that 
the detection performance is very good up to 1.75 Gpc. At 2 Gpc, the detection rate 
is still in excess of 50% for an overall false alarm probability of a few percent. The 
source at 3 Gpc represents the absolute limit of this particular search, since that is the 
point at which this search ceases to do any better than a random one. This should be 
contrasted with the performance of the semi-coherent matched filtering technique pQ. 
An ROC curve is not available for that algorithm, but based on the results of Gair et 
al., at an overall false alarm probability of 1% the detection rate for this source at a 
distance of 2 Gpc would likely be close to 100%. However, as emphasised before, this 
improved performance comes at much higher computational cost. 

In conclusion, we have presented a proof of principle that a simple time-frequency 
method could be used to detect GWs from bright EMRIs. A typical EMRI source could 
possibly be detected with SNR > 6 at a distance up to ~ 2 Gpc using this method. The 
method is computationally efficient in the sense that it takes only minutes to finish a 
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search of EMRIs with one computer. Based on current estimates of the astrophysical 
rates [TJ |3] , tens of EMRIs could be detected each year by this technique. 

This method does not provide good parameter determination, but it could be used 
to detect the brightest sources as the first stage of a hierarchical search. The method 
provides some information about the frequency content and inspiral rate of an event 
which can be used to refine a subsequent matched filtering search. In practice, the EMRI 
detection problem will be made considerably more complicated by confusion with other 
sources in the LISA data, in particular confusion from white dwarf binaries. The time- 
frequency tracks of these other sources will look different to EMRIs. However, the tracks 
will overlap and a simple excess power method might not be able to distinguish multiple 
overlapping sources from one another. Further, in the current analysis, we have only 
considered a single 'typical' EMRI signal, but the frequency and frequency evolution 
of other EMRIs will be different, which will change the detection statistics. Finally, 
the approximate quadrupole waveforms used in this analysis lack some of the multipole 
structure that we expect from true inspirals, which will also change our conclusions. 
More detailed discussion of these issues will be provided in a follow-up paper [TT] . 
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Figure 1. Left - the t-f (normalised) power density for the optimal box size. Right 
- the distribution of power (circles) plus expected distribution for pure noise (solid 
line). The upper plots are for d = 0.5 Gpc (optimistically, we expect < 3 such events 
in three years). This could be detected at a FAP of < 10~ 16 and a maximal SNR of 
~ 28. The lower plots are for d = 1 Gpc (we expect < 25 events in three years) and 
have FAP< KT 16 and maximal SNR- 14. 
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Figure 2. As Figure^but for d — 1.4 Gpc (upper plots, expect < 60 events in three 
years, FAP< 10~ 10 and SNR maa; ~ 8) and d = 2 Gpc (lower plots, expect < 180 events 
in three years, FAP- 2 x 10~ 6 and SNR ma:E ~ 7). 
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Figure 3. Approximate ROC curve for this method. The detection rate is shown as a 
function of the overall false alarm probability of the search, when the source is placed 
at distances of 1, 1.4, 1.75, 2, 2.25, 2.5 and 3 Gpc from the detector. The performance 
of a random search, for which the false alarm rate equals the detection rate, is shown 
for comparison. 



